Naive Credal Classifier 2: a robust approach to classification for small and incomplete data sets

نویسندگان

  • Giorgio Corani
  • Marco Zaffalon
چکیده

Naive Credal Classifier, which is an imprecise-probability counterpart of Naive Bayes, is rigorously extended to a very general and flexible treatment of incomplete data, yielding a new classifier called Naive Credal Classifier 2 (NCC2). The new classifier delivers classifications that are robust to the presence of small sample sizes and missing values. In particular, some empirical evaluations show how, by issuing set-valued classifications, NCC2 is able to isolate, and properly deal with instances that are hard to classify (on which Naive Bayes’ accuracy drops considerably), and to perform as well as Naive Bayes on the others. The experiments point also to a more general problem: they show that with missing values the empirical evaluations may not reliably estimate the accuracy of a traditional classifier, such as Naive Bayes. This appears to add even more value to the robust approach to classification implemented by NCC2. keywords: Naive Bayes; Naive Credal Classifier; Imprecise Probabilities; Missing Values; Conservative Inference Rule; Missing At Random.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

JNCC2: An extension of naive Bayes classifier suited for small and incomplete data sets

JNCC2 implements the Naive Credal Classifier 2 (NCC2), i.e., an extension of naive Bayes to imprecise probabilities, designed to return robust classification even on small and/or incomplete data sets, which is often the case in environmental case studies.

متن کامل

Learning Reliable Classifiers From Small or Incomplete Data Sets: The Naive Credal Classifier 2

In this paper, the naive credal classifier, which is a set-valued counterpart of naive Bayes, is extended to a general and flexible treatment of incomplete data, yielding a new classifier called naive credal classifier 2 (NCC2). The new classifier delivers classifications that are reliable even in the presence of small sample sizes and missing values. Extensive empirical evaluations show that, ...

متن کامل

Reliable diagnoses of dementia by the naive credal classifier inferred from incomplete cognitive data

Dementia is a serious personal, medical and social problem. Recent research indicates early and accurate diagnoses as the key to effectively cope with it. No definitive cure is available but in some cases when the impairment is still mild the disease can be contained. This paper describes a diagnostic tool that jointly uses the naive credal classifier and the most widely used computerized syste...

متن کامل

JNCC 2 user manual and tutorial ( rev 1 )

This paper introduces JNCC2, the Java implementation of the Naive Credal Classifier 2 (NCC2). JNCC2 is open source; it is hence freely available together with manual, sources and javadoc documentation. NCC2 is an extension of Naive Bayes Classifier (NBC) to imprecise probabilities, designed so as to return robust classifications even on small and/or incomplete data sets. A peculiar feature of N...

متن کامل

JNCC2: the Java implementation of the Naive Credal Classifier2

This paper introduces JNCC2, the Java implementation of the Naive Credal Classifier2 (NCC2). JNCC2 is open source; it is hence freely available together with manual, sources and javadoc documentation. JNCC2 implements the Naive Credal Classifier2 (NCC2), i.e., an extension of Naive Bayes Classifier (NBC) towards imprecise probabilities. NCC2 is designed to return robust classification, even on ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007